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The human INGl gene encodes nuclear protein 
p^g/TVGj^ previously shown to cooperate with p53 in cell 
growth control (Garkavtsev, I., Grigorian, I. A., Oss- 
ovskava, V. S*, Chernov, M. V., Chumakov, P, M,, and 
Gudkov, A. V. (1998) Nature 391, 295-298). p33'^^^^ be- 
longs to a small family of proteins from hvmian, mouse, 
and yeast of approximately the same size that show 
significant similarity to one another within the C-termi- 
nal PHD finger domain and also Qontain an additional 
N-terminal region with subtle but reliably detectable 
sequence conservation. Mouse ingl is transcribed from 
three differently regulated promoters localized within a 
4-kilobase pair region of genomic DNA. The resulting 
transcripts share a long common region encoded by a 
common exon and differ in their 5'-exon sequences. Two 
transcripts are translated into the same protein of 185 
amino acids, the mouse equivalent of the human 
p^g/iVGj^ while the third transcript encodes a longer pro- 
tein that has 94 additional N-terminal amino acids. 
Overexpression of the longer protein interferes with the 
accumulation of p53 protein and activation of p53-re- 
sponsive promoters after DNA damage. Between the two 
products of ingjy only the longer one forms a complex 
with p53 detectable by immunoprecipitation. These re- 
sults indicate that a single gene, ingl, encodes both p53- 
suppressing and p53-activating proteins that are regu- 
lated by alternative promoters. 



The INGl gene was identified as a result of a functional 
screening of genes, the suppression of which is associated with 
neoplastic tramsfonnation (1). Inhibition of INGl expression by 
antisense RNA promotes anchorage-independent growth in 
mouse breast epithelial cells, increases the frequency of focus 
formation in NIH 3T3 cells, and prolongs the life span of diploid 
human fibroblasts in culture. INGl expression is up-regulated 
in senescent human fibroblasts (2), and ectopic expression of 
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INGl cDNA leads to arrest or promotes apoptosis in several 
cell types (3). The accumulated observations indicating INGl 
participation in the negative regulation of cell proliferation, 
control of cellular aging, and apoptosis have defined INGl as a 
candidate tumor suppressor gene. 

We have recently foimd that the biological effects of INGl 
and p53 are interrelated and require the activity of both genes. 
Neither of the two genes can, on its own, cause growth inhibi- 
tion when the other one is suppressed (4). Furthermore, acti- 
vation of transcription from the p21/WAFl promoter, a key 
mechanism of p53-mediated growth control, depends on the 
expression of INGl. A physical association between p33^^^^ 
and p53 proteins is detected by immunoprecipitation. These 
results indicated that p33^^'^^ is a component of the p53 sig- 
naling pathway and that p33^^'^^ cooperates with p53 in neg- 
ative regulation of cell proliferation by modulating p53-depend- 
ent transcriptional activation. 

Despite the apparent importance of INGl in the control of 
cell proliferation, our knowledge of expression, regulation, and 
function of this gene remains incomplete. Moreover, Gen- 
Bank"^^* contains two INGl mRNA sequences differing in their 
5 '-ends. The origin of these differences is unknown and re- 
quires explanation. So far, all of the information about the 
function of p33^'^'^^ was obtained fi"om in vitro experiments 
that involved ectopic expression of INGl cDNA or its suppres- 
sion by antisense RNA. The analysis of the structure of the 
INGl gene and its regulation in vivo are essential steps toward 
the understanding of its function and involvement in develop- 
mental and physiological processes. This is particularly impor- 
tant due to the cooperation between INGl and p53, which 
suggests that the functioning of the p53 signaling pathway 
could be dependent on the regulation of INGl expression. 

In the present vs^ork, we used the mouse ortholog of the 
human INGl gene for detailed structural and expression stud- 
ies, with the goal of subsequently utilizing it as a system for 
extensive genetic analysis, ingl was found to be a highly evo- 
lutionarily conserved gene with complex regulation, which in- 
volves generation of alternative transcripts initiated from dif- 
ferent promoters and translated into proteins that differ in 
structure and expression patterns. Moreover, these proteins 
have opposite effects on p53 -regulated transcription, indicating 
that ingl encodes two products v^ath the properties of a puta- 
tive tumor suppressor and a putative oncogene. 

MATERIALS AND METHODS 
Plasmids and. Libraries — Retroviral vector pLXSN, used for the in- 
troduction of mouse and human INGl cDNA in NMuMG, 10(1), and 
ConA cells, was provided by A. Dusty Miller (5). Retroviral vector 
pLXIG was constructed on the basis of pLXSN vector by substituting 
SV40 promoter and neo sequences with the internal ribosome entr>' site 
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of encephalo myocarditis virus (6) followed by the enhanced green fluo- 
rescent protein sequence (7). This vector permits us to translate both 
the gene of interest (cloned upstream of the internal ribosome entry 
site) and enhanced green fluorescent protein from a single bicistronic 
mRNA transcribed from Moloney murine leukemia \nrus long terminal 
repeat promoter. A cDNA library from senescent mouse embryonic 
fibroblasts was constructed using Superscript system (Life Technolo- 
gies, Inc.) according to the manufacturer's protocol and cloned into the 
A phage A gt22A vector. Stratagene's 129SVJ mouse genomic library 
cloned into the A FIX 11 vector was used for the isolation of the mouse 
genomic, -containing clones. 

Cell Lines — (Cultures of mouse embryonic fibroblasts were obtained 
from 10-day-old embryos. 10(1) cell line, a derivative of Balb/c 3T3 cells 
that spontaneously deleted both p53 alleles (8), was kindly provided by 
Arnold Levine. Pseudonormal mouse mammary gland epithelial cell 
line (NMuMG) was obtained from the ATGC collection. Ecotropic ret- 
roviral packaging cell line BOSC23 (9) was kindly provided by Warren 
Pear and David Baltimore (Massachusetts Institute of Technology). The 
ConA cell Hne, a derivative of BALB/c 3T3 cells 12-1 (8) with wild type 
p53, was described earlier (10). It carries the lacZ gene encoding jS-ga- 
lactosidase Escherichia coll under the control of the p53-dependent 
promoter and therefore allows monitoring p53 transcriptional activa- 
tion by a routine X-gal staining. All cells were maintained in Dulbecco's 
modified Eagle's medium containing 10% fetal bovine serum supple- 
mented with penicillin and streptomycin. For the serum starvation 
experiments, 10(1) and NMuMG cells were kept with 0.5% fetal bo\'ine 
serum for 48 h (NMuMG cells) or 36 h (10(1) cells). For the contact 
inhibition experiments, cells cultures were used 48 h (NMuMG) or 36 h 
(10(1)) after they became confluent. NMuMG and 10(1) cells were 
irradiated with 10 grays of y-radiation and used 24 h after treatment. 
Mouse embryo fibroblasts were propagated until they reached senes- 
cence. Populations of ConA cells, expressing different levels of p37^'^*^^, 
were generated by transduction with pLXIG vector, carrying p37'^'*-" 
cDNA, followed by fluorescence-activated cell sorting of cells witli dif- 
ferent levels of fluorescence. 

Animals — Organs and embryos of FVB/N mice were used for the 
RNA isolation and preparation of whole body sections of embryos using 
a cryostatic microtome for histoblot hybridization. 

Hybridization Screening of cDNA and Genomic Libraries — cDNA 
and genomic library screenings were done according to the standard 
protocols (11) using ^^P-labeled human and mouse ingl cDNA probes, 
respectively. 

5'- and 3' -RACE ^ — Alternative 5 '-ends of the mouse ingl were iso- 
lated from mouse spleen and mouse brain cDNAs using the Marathon- 
ready cDNA kit (CLONTECH), according to the protocol suggested by 
the manufacturer. API adaptor-specific sense primer, provided by 
CLONTECH, and the ingl -specific antisense primer (5'-CCATCT- 
GACTCACGATCTGGATCTTC-3') were used for PGR. Nested PGR was 
performed using AP2 adaptor-specific sense primer, provided by CLON- 
TECH, and in^7-specific antisense primer (5'-CTGCGGATCAG<5GC- 
CCTCTGGATGC-3'). Precise determination of the 5'- and 3 '-ends of the 
mouse i.ngl transcripts was done using the Marathon-2 cDNA amplifi- 
cation Kit based on the new SMART PGR cDNA s>Tithesis technology 
(CLONTECH). It is based on the recently identified ability of Moloney 
murine leukemia virus reverse transcriptase to add several nucleotides 
to the 3' terminus of first-strand cDNA during the reverse transcriptase 
reaction (12). Briefly, when reverse transcriptase reaches the 5 '-end of 
the mRNA, it switches templates and continues sjTithesizing the 
SMART template-switching oligonucleotide. The resulting single- 
stranded cDNA contains the complete 5 '-end of the mRNA as well as 
the sequence complementary to the template-switching oligonucleotide 
and is then selectively amplified by PGR. In these experiments, poly(A) 
RNA preparations isolated from the thymus and testis were used. The 
following sequences were used for the synthesis of antisense ingl- 
specific primers: 5'-AGGTGTGGT(X>GATCGGCAACGC-3' (forisoform 
la), 5'-CGGCXJGGAGCCAGAGCAGAGAAGGT-3' (isoform Ic), and 5'- 
GGGGTGGCCTGTGATTGTCGOTG-3' (isoform lb). tn^2-specific sense 
primer 5'-GGGTGCTTCTT(jGTACCAT-3' was used for the PGR ampli- 
fication of the 3'-end, 

Sequence Analysis — Sequencing was done using a Sequenase version 
2.0 DNA sequencing kit (U.S. Biochemical Corp.) or done by the Uni- 
versity of Chicago Cancer Research Center DNA Sequencing Facility. 
In all cases, both strands were read using multiple vector-specific or 



^ The abbre\aations used are: RACE, rapid amplification of cDNA 
end.s; PGR, polymerase chain reaction; X-gal, 5-bromo-4-chloro-3-indo- 
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gene-specific primers. Protein sequence data base searches were per- 
formed using the gapped BLASTP program and the PSI -BLAST pro- 
gram that iterates the search using profiles constructed from BLAST 
hits as queries for subsequent iterations (13). Multiple sequence align- 
ments were constructed using the Gibbs sampUng option of the MA- 
CAW program (14, 15). GenBank*^ accession numbers for the de- 
scribed sequences are AF177753-AF177757. 

Southern, Northern, and Western analyses were done according to 
standard protocols. IgGl mouse monoclonal antibody against human 
recombinant p33^^^^ (16) was used for the detection of mg7 -encoded 
proteins. Anti-p53 monoclonal mouse IgG antibody Ab-1 was obtained 
from Calbiochem. p21 rabbit polyclonal IgG antibody was obtained from 
Santa Cruz Biotechnology, Inc. (Santa Cruz, CA). 

Immunoprecopitation Analysis — Immunoprecipitation of p53 was 
carried out using 1 jiLg of DO-1 (Santa Cruz Biotechnolog>0 and 1 ixg of 
Ab-1 (Calbiochem) antibodies per 0.5 ml of cell lysate. Cell lysate was 
obtained by transient transfection of the corresponding vectors by Li- 
pofectAMINE Plus (Life Technologies, Inc.) into p53-negative Saos-2 
cells. Each transfected plate of cells was lysed in 0.5 ml of modified 
radioimmune precipitation buffer (25 mil Tris, pH 7.4, 125 mM NaCl, 
1% Nonidet P-40, 0.5% sodium deoxycholate with protease inhibitor). 
Sepharose-protein A (Amersham Pharmacia Biotech) was used to pull 
out the antibody-protein complexes, and pellets were washed four times 
with modified radioimmune precipitation buffer with 0.5% Nonidet 
P-40. PeUets were boiled in SDS-loading dye and run on 10% SDS- 
polyacrylamide gel electrophoresis. Western blot was carried out with 
anti-HA-biotin antibodies, with clone 12CAS (Roche Molecular Bio- 
chemicals), or with anti-p53 biotinylated antibodies (Roche Molecular 
Biochemicals) according to standard protocols. 

In Situ Hybridization on Nitrocellulose CHistoblottingV — Whole 
body sections, 20 ;xm thick, were prepared from frozen embryos embed- 
ded in blocks of Tissue-Tek O.C.T. and stored at -70 °C. Sections were 
placed on nitrocellulose (Schleicher & Schuell) to prepare histoblots as 
described previously (17). Histoblots were hybridized with ^"^^P-labeled 
RNA probes synthesized using Ambion's IVLAXIscript in vitro transcrip- 
tion kit. i;i^7 -specific sense and antisense probes were synthesized on 
the pBLUESCRIPT plasmid with the fragment of ingl cDNA corre- 
sponding to the PHD finger domain using T3 poljmierase for the anti- 
sense and T7 polymerase for the sense probe. ^-Actin antisense RNA 
was synthesized using the template provided by Ambion. Histoblots 
were incubated in prehybridization solution (0.75 mg/ml yeast tRNA, 
0.75 mg/ml poly(A), 50% formamide, 0.3 m Tris, pH 8.0, 1 mM EDTA, 5x 
Denhardt's solution, 10% dextran sulfate, 10 mM dithiothreitol) at 42 "C 
for 1-4 h. Hybridization w^as carried out for 12-24 h at 42 "C in the 
same solution containing 1-5 x 10" cpm of probe/ml. After hj^bridiza- 
tion, histoblots were washed in 4x SSC at room temperature for 20 min, 
in 2x SSC, 0.04 ^ml RNase A at 37 ^'C for 30 min, in 2x SSC at 37 **C 
for 30 min, in Ix SSC at 60 '^C for 15-30 min, and finally in 0.1 X SSC 
at 60 °C for 15-30 min and exposed to x-ray film for 2-7 days. 

RESULTS 

Multiple Transcripts of the Mouse ingl Gene Differ in Their 
5' -End Sequences — In order to isolate the mouse ortholog of the 
INGl gene, we screened a cDNA librarj' prepared from senes- 
cent mouse embryonic fibroblasts, using human INGl as a 
probe. The choice of the librar>^ was determined by the fact that 
INGl is expressed at higher levels in senescent, compared with 
normal, human fibroblasts (2). As a result, several clones were 
isolated, and the two longest were sequenced. The clones were 
identical to each other and highly similar to human INGl 
through most of their length, except for the 5 '-ends, which were 
different and not homologous to the human gene (Fig. lA). This 
observation could be an indication of alternative splicing of 
mouse ingl; however, it could also be potentially explained by 
cloning artifacts that occurred during the cDNA library prep- 
aration. To determine the structure of 5 '-end sequences of ingl, 
we used a 5' -RACE technique for the isolation of cDNA se- 
quences corresponding to the 5 '-termini of ingl mRNA. cDNA 
was synthesized from mouse spleen and brain mRNA and 
ligated to sjTcithetic adaptors. The cDNA was amplified by PGR 
using a sense primer specific for the adaptor and an antisense 
primer specific for the common part of ingl located close to the 
divergent region. Two fragments of different size were subse- 
quently cloned, sequenced, and compared with the clones iso- 
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Fig. 1. Cloning of different variants of mouse ingl cDNA and identification of their corresponding mRNA transcripts. A, schematic 
alignment of the mouse ingl cDNA clones and 5 '-RACE products with human INGl cDNA la and lb represent mouse ingl cDNAs isolated from 
the library of senescent mouse embryonic fibroblasts. A 5 '-RACE reaction gave two products, one identical to lb and another shown as Ic. The 
position of the gene-specific PGR primers used for the RACE reaction is indicated. Mouse ingl sequences are aligned with human INGl cDNA The 
open reading firame for the human INGl clone is indicated. B, analysis of ingl transcription by Northern hybridization, ingl expression in mouse 
liver, heart, and testis was analyzed using a multiple- tissue Northern blot (CLONTECH Laboratories), which was hybridized vtnth the probe 
corresponding to the common part of ingl {Lane 11 the 5'-end of isoform la (lane 2), the 5 '-end of isoform lb (lane 3), or the 5'-end of isoform Ic 
(lane 4). Probes were obtained by PGR using ingl -specific primers. An arrow shows the position of the RNA marker, 2.37 kilobases in size. 



lated from the cDNA library. The results of this comparison are 
schematically presented in Fig. lA. One of the 5 '-RACE prod- 
ucts was identical to one of the previously isolated cDNA 
clones. Another product revealed the third variant of ingl 
cDNA, which again consisted of common £ind unique regions, 
with the junction located at exactly the same position as in the 
other sequences. Fig. lA shows schematic alignment of the 
isolated mouse ingl cDNA clones and the 5 '-RACE products. 
All variants are identical to each other (and homologous to 
human INGl) up to the same nucleotide and differ only in their 
5 '-ends. 

In order to verify that the observed cDNA structure reflects 
naturally existing transcripts, we analyzed ingl mRNA species 
by Northern hybridization using probes that represent common 
or specific regions of the isolated cDNAs. As showTi in Fig. IB, 
the probe for the common region revealed multiple transcripts 
in RNA isolated from mouse liver, heart, and testis. Probes 
specific for individual transcripts, however, showed more sim- 
ple hybridization patterns, which in combination covered the 
whole set of transcripts found by hybridization with the com- 
mon probe. These results indicated that the cloned sequences 
represent the majority of multiple transcripts of ingl synthe- 
sized in the thymus and spleen. 

Mapping of Coding Regions of ingl in Mouse Genomic 
DNA — Comparison of sequences of individual cDNA clones 
showed differences in their 5' regions, suggesting that ingl has 
multiple alternative 5'-exons (Fig. 1). To verify this hypothesis, 
we determined the structure of the mouse ingl gene. We iso- 
lated phage clones carr^'ing sequences homologous to the 
mouse ingl cDNA by hybridization screening of a mouse 
genomic library. These clones were mapped by restriction di- 
gestion analysis, in combination with Southern blot hybridiza- 
tion, with the probes corresponding to the different ingl parts. 
Tlie interpretation of the results obtained is shown in Fig. 2A. 

Comparison of genomic and cDNA sequences of ingl using 
PGR with different m^^i -specific primers (data not shown) as 
well as with the sequencing data showed that most of the 
transcribed sequences of the ijigl gene come from a single exon. 
Alternative 5 '-ends are encoded by different exons positioned 
upstream from the common exon. Comparison of the sequences 
of cDNA and genomic clones, as well as 5 '-RACE products. 



reveeded three isoforms of mouse ingl that differ from each 
other only in their 5 '-ends, which also indicates that each 
isoform is most probably expressed from its own promoter. 
Southern blot hybridization analysis of m^l -related sequences 
in the mouse genome indicated that ingl is a single gene v^dth 
no obvious close family members (data not shown). 

The length of the isolated cDNA clones appeared to be sig- 
nificantly shorter than that of the mRNA species detected by 
Northern hybridization (Fig. IB), suggesting that part of the 
transcribed sequences were missing from the isolated cDNAs. 
To determine the exact start sites of ingl transcription, we 
used a new procedure called "SlVlART-based 5 '-RACE'' as de- 
scribed under "Materi£ds and Methods." Using this method, we 
were able to extend the cDNA sequences for the 5 '-end of 
isoform lb. This exon contains an extremely G/C-rich region 
that blocked DNA elongation during the original 5 '-RACE re- 
action (Figs. 2B and 3) and complicated sequencing of the final, 
long 5 '-RACE product. Therefore, the transcription start site of 
isoform lb was not precisely identified, since it was estimated 
based on the size of the 5 '-RACE product and the analysis of 
the genomic sequence. 

Comparing the 3'-ends of human and mouse INGl cDNAs 
showed that although the mouse transcript was flanked by a 
poly(A) stretch, its 3 '-untranslated region was significantly 
shorter than the human one. Analysis of the genomic clone 
with, the mouse ingl sequence revealed the presence of a long 
poly(A) stretch that could potentially^ be used as a primer- 
binding sequence for reverse transcription initiated from oli- 
go(dT) primers (Fig. 3C). Moreover, the alignment with the 
3 '-end of the human INGl cDNA sequence resumes down- 
stream from this genomic poly(A) stretch. The exact end of ingl 
transcription in mice was determined using the same method 
applied for the generation of 5 '-sequences. Mouse ijigl cDNA 
was, in fact, foimd to be longer than was originally thought 
(Figs, 2B and 3). Sequence comparison showed that mouse and 
human INGl transcripts terminate at the same point and 
share significant levels of similarity up to the very 3 '-ends. 

Sequence Analysis of ingl — Nucleotide and predicted amino 
acid sequences of the three mouse ingl cDNA isoforms and the 
promoter regions for all three alternative transcripts are shown 
in Fig. 3, It was found that ingl isoforms la £uid Ic have several 
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Fig. 2. Mapping of mRNA-coding 
regions within the mouse ingl locus. 

A, map of the mouse genomic region con- 
taining ingl cDNA-related sequences. 
Tiiree -positive genomic clones were 
isolated from the mouse genomic library 
using ingl cDNA as a probe (G2, G9, and 
GlO), and analyzed by restriction diges- 
tion and Southern blot hybridization. 
This analysis, in combination with the 
PGR data and with partial sequencing of 
the genomic clones, allowed us to deter- 
mine the positions of the ingl exons. B, 
determination of transcription initiation 
and termination sites in the ingl gene. 
The upper panel shows the genomic map 
with the positions of the ingl exons and 
the structure of ingl cDNA clones as de- 
termined after cDNA library screening 
and 5'-RACE analysis. The structure of 
two known human INGl cDNA isoforms 
in relation to mouse ingl sequences is 
shown above the genomic DNA. The lower 
panel demonstrates the final structure of 
the ingl gene determined as a result of 
precise mapping of transcription initia- 
tion and termination sites. In the upper 
panel, the position of the G/C-rich region 
in the 5 '-end of clone lb and the poly(A) 
region in the 3' part of the ingl tran- 
scripts are indicated, which interfered 
with the polymerase chain reaction dur- 
ing the original 5'-RACE. 
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tightly clustered transcription start sites. Sequences upstream 
of the initiation sites (putative promoters) lack TATA boxes, 
but they do contain a sequence corresponding to the loose 
initiator consensus PyP>^AN(T/A)PyP>^ that includes the tran- 
scription start sites (Fig. 3). In both cases, areas upstream from 
the initiator are extremely (}C-rich with multiple Spl-binding 
sites. All of these features are typical of TATA-less promoters 
(18, 19). Binding sites of inducible factors that are usually 
present further upstream in the promoter area (e.^. CAAT box, 
Oct-1 and Oct-2 binding site, NF-kB, or ATF binding site) were 
not found in these promoters. Transcription of ingl isoform lb 
is estimated to start approximately 30 nucleotides downstream 
from the TATA-like box. A putative CAAT box is present 60 
nucleotides upstream from the TATA-like sequence. Since the 
transcription initiation sites for isoforms la and lb are only 
about 200 base pairs apsirt, there is a possibility that their 
promoters share some of the regulatory sequences including, 
for example, a number of Spl binding sites located upstream of 
this area. 

All three alternative transcripts of mouse ingl contain the 
same long open reading frame, although the sizes of the pre- 
dicted proteins are different. While isoform lb encodes a pro- 
tein of 279 amino acids, the other two isoforms are predicted to 
encode a shorter protein product of 185 amino acids, which 
lacks 94 N-terminal amino acids. Translation of these two 
products is expected to start from the initiation codon, which is 
located at the beginning of the common exon. Alignment of the 
predicted amino acid sequences with that of human p33^''^'^^ 
(GenBank™ accession number AF044076) revealed high sim- 
ilarity between the mouse and human proteins (89% sequence 
identity) (Fig. 4). 

The nonredundant protein sequence data base at NCBI was 
searched using the i/i^i -encoded protein sequence as a query, 



and a highly significant sequence similarity was detected with 
a human paralog of p33^^'^^. These included three uncharac- 
terized proteins from the budding yeast Sacchuromyees cerevi- 
siae and their homologue from fission yeast Schizosaccharomy- 
ces pombe (probability of the similarity being observed by 
chance <10"'^). All of these proteins are approximately the 
same size and contain a C-terminal PHD fiunger domain (20, 
21). The sequence conservation in the PHD domain in these six 
protein sequences is striking. There are 13 invariant residues 
in addition to the eight metal-chelating cysteines and histi- 
dines that are conserved in all PHD fingers (Fig. 4B). Multiple 
alignment analysis resulted in the delineation of an additional 
N-terminal region that is conser\^ed in these proteins (Fig. 4B). 
This region consists of approximately 100 amino acid residues, 
includes two distinct conserved motifs, and shows a fairly sub- 
tle similarity that was not statistically significant in the con- 
text of the screening of the complete data base, except for the 
conservation between the two human paralogs. However, in the 
reduced search space defined b}^ the presence of the PHD fin- 
ger, it was shown that the probability of finding this level of 
similarity by chance was less than 10"^^ for the distal motif 
and 10"^ for the proximal motif 

Expression of ingl Transcripts in Adult and. Embryonic 
Mouse Tissues — We analyzed the expression of ingl mRNA in 
the organs of adult mice and mouse embryos at different stages 
of development by Northern blot hj^bridization. The probe rep- 
riesenting the common exon of the ingl gene revealed multiple 
mRNA size classes that represent alternative transcripts of 
ingl (Fig. 5A). Overall ingl mRNA expression is most abun- 
dant in thjTnus and testis. Much lower levels were detected in 
the rest of the tissues tested, and they also differed in the 
content and relative intensity of the hybridizing bands. The 
same pattern of ingl mRNA expression was observed in p53- 
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C/\CAGTCCCA GCCATAC?U\A GGCGCGAATG 

CCGCTTGCCTA GGCflGGAAGA GGGCGGGACC 

C-1'C'l'GCCCGG GC'J-l"n*CAGA AAGCCC'J AAA 

GAAGCTGAAG GTTGTGCTCC CCTGCGGGAA 

GTGACGAGGA CATGGGCAGC CTCAGA7GCG 

AGAAGAGCTG CTCCCGCCAG CCCGCCCkCC 

CGGGGGCCCC TTTGTCTCCA GGCCATTCTA 

CGA7GGGCCC GSAAGGCGCA GGCGCG7GGA 

TTTGCGTG TC GATAGCTGCC CTCCC GCGCG 

m i *'la 

TCCCCCTC£X_CT1£;CCCCTC GCTCCCCTCC 

TCTCTAGTAC TAGGCGTGCG GGGGGCGTTG 

ArpgSCC GCA GAGGrrAGAG TCGCAAGGGG 

exnn^ ' ituron 

GGGGCCCTAT TTATACCGCG CTAGGAGGCG 

CTCTTCTCAG GTTAGTCGGT TTGAAGGCAG 

TCTCGAGGAG ACAGGCTAGC ACCCCGCGGT 

GTCCGCCACC GAGTGTTAAA CTCCTAGTAA 

GCGACCCCGC CGCTGTCCCC TCGGCGACCC 

CCCGCGCGGG CCGGGCTCGC GGCCGGCCGC 

TGCGGTGGGG GCGGGGCCGC CTCCGGGAGG 
CGGGGCTGAA TGTTTCCCAA GTGTTTGAAfl 

TGCGGCGAGG CGGGCGGCGG AGCGCGCCCT 

CCACGCTGGT GGCGGrG?GG GCGGCGGTGG 

ACGCACGGCC CCCCGCGCCG CCCGCCGCCC 

GCGCTTGAGC TTCGGCCGCC GCGGCCCGCG 

GAGGCGGCGC GGCAGGCGCC AGATGTAGCC 

GCGCGGAAGG GCGAGAGAGC TTTGC:ATTTT 

AGGAAGCGGA AAGCCGCCGA ATCGCCGGGG 

CAACGGGGAG CAGATCCACC TGGtGAACTA 

AC-JGCC'J'TTC GACCTGCAGA GGAACG'ICTC 
AGGTACGGCG 
exon I iairon 



TGGCGACCAC AGGCCAGGCT CCGCCCCr\tX 

gccggggccc gtcgcctagg caacccctcg 
'J'CCGGCAGCT gtgggggaag ggcgggatgg 
gcacac7gcg gtgcccgctg ccaactcagt 
gcagggaggg gaagagctag ttaggatcag 

GTAGATCTCC GCGGTGAAAA GAGCACCCCT 
AACCCAGCAC CGGGAGGCGA CACAAAGGGA 
GGCGCCGAGG ATGCTGGGAG TGGTGGTCCC 
GGTCCGCATC CATCGCCAGG GAGCTGCGCC 



CCCCCCTCCT CCCTCCTCCA ACACCACAGC 
CCGATCCCAC CACACCTCCT TCrCGTCCAfi 
ArTGGCTGAC GCCGTrGCCA TTTCTGTTCG 



GGGGAATGCA GTTAACCCAG CGGAGGGTGG 
CGATCTGGAG GCTCGTCCGG CCCCGGCGTG 
TCCCGGCCGG GGTGCGGCTC CGCCCAGTAG 
AGT7TCGCGT GGCCGGTCTC CCCTGCCCAG 
TGGCCCCCAG CGACAATGAC AGGCCACGCC 
CCCCCGGCCC GGGACGGTGA GGGGCGTGAA 
AGGGGTGGCG GAGCGCATGC GCGC7GCGCG 
CTGGTATTTG GGTTT7CCAC GTTGGACAAG 
TCCCGCAGCC GGCGCCGCTC TCTCCGCTCT 
CCGGCCCTCT TTGGG7GTGT GCGCCCTAGT 
GGGAGGGGGC CTGCACGGCC GGCGGGGTGT 
CCTGCAGGCG CCGGGGTCCC CGCGGCCCCG 
GCCGGCCAGC CCGGCCGGAG CGGCGGGGGG 
GCAGTGCTGT TTGAGGGGGG CGGGGGGTGG 
ACC7CCGGGG TGAACCATGT TGAGTCCTGC 
'J'GTGGAGGA'l' TACCTGGAC'l' CAA'l'CGAGTC 
GC I*GA I GCGG GAGATCGACG CCAAA TACCA 



ACAGAGTCAT TGGGCGGCGG GCAGTATGGC 
TGTGTTTCCG CTGTCTGCTT TTTTTTTTTT 
AGAGGAGGTG AGTTGAT?TG AATGTCTTCG 
CCTGTCGCrG CTGGGCCGGG CCACCGlCGC 
GGTCTCCGCC CTCAGCGTCTG TCGACTCACA 



GCGCGCTTCC J'GAGTCTCG'J- AGGCtGGaGT 
ATGGCGGCGG TGGCCTCCGG GAGGATGCTG 
GGAGCCTCTG ATCGCTTGTC GCGTTTCCGG 



GGCCCGCriT GCTCGGGACT CGCTCCAGTC 
TTT7T7CCTC TGGTAGAGGG CCGGCCGGAG 
GGTCGCCCGT CCTCTGGCCT TGGGTTGGCT 
TGGCTC'l'GCG CGCTGATTGG J'G'J'ACC'J'CCT 
TAGGC17CGC GCGGGGCGG3 GCAGGGCGOG 

n *' ic 

GGATCGCGGC^CAilxr^CCGG CTGCGAGGCT 
CGCTTACCTT CTCTGCTCTG GCTCCCCGCG 
TAGGCT7GAA TGAGCGGGAG TGCCTCTGTC 



ACTTCCGCCTCGCTCGGCTCCGCCCGCTCGTCCCTCCTGGAAGAGGACAGCTCTCTAGTA 60 

CTAGGCGTGCGGGGGGCGTTGCCGATCCCACCACACCTCCTTCTCGTCCAGATTGGCCGC 120 

AGAG 124 

Exon lb 

GTTAACCC AGC GGAGGGTGGCTCTTCTC AGGTTAGTCGGTTT GAAGGCAGC GATCTGGAG 60 

GCTCGTCCGGC CCCGGCGTGTCTC GAGGAG AC AGGCT AGCACCCCGCGGTTCCCGGCCGG 120 

CGTGCGGCTCCGCCCAGTAGGTCCGCCACCGAGTGTTAAACTCCTAGTAAAGTTTCGCGT 180 

CGCCGGTCTCCCCTGCCC AGGCGAC CCCGC CGCTGTCCCCTCGGCGACCCTGGCCCCCAG 2 40 

CGACAATGACAGGCCACGCCCCCGCGCGGGCCGGGCTCGCGGCCCGCCGCCCCCCGGCCC 300 

GGGACGGTGAGGGGCGTGAATGCGGTGGGGGCGGGGCCGCCTCCGGGAGGAGGGGTGGCG 3 60 

GAGCGCATGCGCGC7GCGCGCGGGGCTGAATGTTTCCCAAGTGTTTGAAACTGGTATTTG 420 

GGTT7TCCACGrTGGACAAGTGCGGCGAGGCGGGCGGCGGAGCGCGCCCTTCCCGCAGCC 480 

GGCCCCGCTCTCTCCGCTCTCCACGCTGGTGGCGGTGTGGGCGGCGGTGGCCGGCCCTCT 540 

TTGGGTGTGTGCGCCCTAGTACGCACGGCCCCCCGCGCCGCCCGCCGCCCGGGAGGGGGC 600 

CTGCACGGCCGGCGGGGTGTGCGCTTGAGCrrCGGCCGCCGCGGCCCGCGCCTGCAGGCG 660 

CCGGGGTCCCCGCGGCCCCGGACGCGGCGCGGCAGGCGCCAGATGTAGCCGCCGGCCAGC 720 

CCGGCCGGAGCGGCGGGGGGGCGCGGAAGGGCGAGACAGCTTTGCATTTTGCAGTGCTGT 780 

TTGAGGGGGGCGGGGGGTGGAGGAAGCGGAAAGCCGCCGAATCGCCGGGGACCTCCGGGG 840 

TGAAC CATGT TGAGTCCTGCCAACGGGGAGCAGATCCACCTGGTGAACTATGTGGAGGAT 900 
MLSPANGEQIHLVHYVED 

TACCTGGACTCAATCGAGTCACTGCCTTTCGACCTGCAGAGGAACGTCTCGCTGATGCGG 960 
XLDSIESLPFDLQRHVSLMR 

GAGATCGACGCCAAATACCAAG 982 
E I D A K Y Q 



CACTTTCCGGCTGCGAGGCTATGGCGGCGGTGGCCTCCGGGAGGATGCTGCGCTTACCTT 60 
CTCTGCTCTGGCTCCCCGCGGGAGCCTCTGATCGCTTGrCGCGTTTCCG 10 9 

Caamon exon 

AGATCCTGAAGGAGCTGGACGACTACTATGAGAAGTTCAAACGGGAGACAGACGGCACCC 60 
EILKELDDYYEKFRRETDCT 

AGAAGCGCCGGGTACTGCACTGCATCCAGAGGGCCCTGATCCGCAGCCAGGAGCTAGGCG 120 
QKRKVLBCIQRALIRSQELG 

ATGAGAAGATCCAGATCGTGAGTCA GATGG TGGAGCTGGTGGAGAACCGCAGCAGACAGG 180 
DBKIQIVSQMV&LVSNRSRQ 

TGGACAGTCACGTGGAGCTCTTCGAAGCACACCAGGACATCAGTGACGGCACTGGTGGCA 240 
VDSBVELFEAEQDISDGTGG 

GCGGCAAGGCGGGCCAGGACAAGTCGAAGAGTGAGGCCATCACACAGGCAGATAAGCCGA 300 
SGR.AGQDKSKSEAITQADKP 

ATAACAAGCGGTCCAGGAGGCAGCGAAACAATGAGAATCGAGAGAACGCGTCGAAT AATC 3 60 
NNKRSRRQRNNENRENASHN 



ACGACCATGATGACATCACCTCAGGAACGCCCAAGGAGAAGAAAGCAAAAACCTCAAAGA 420 
BDHDDITSGTPKEKKIKTSK 

AGAAGAAACGCTCCAAGGCCAAAGCAGAGAGGGAAGCGTCTCCTGCCGACCTTCCCATCG 480 
RKKRSrAKAERE ASPADLPI 

AC CCCAACGAGCCCACGT ACTGTCTGTGC AACCAGGTC TCCT ACGGGGAGATGATCGGCT 540 
DPNEPTTCLCHQVSTGEMIG 

GTGACAACGACGAATGCCCCATCGAGTGGTTCCACTTCTCCTGCGTGGGGCTCAACCATA 600 
CDNDECPIEWFHFSCVGLNB 

AACCAAAGGGCAAGTGGTACTGCCCCAAGTGCCGTGGGGAGAGCGAGAAGACCATGGACA 660 
KPKGKWyCPKCRGESEKTMD 

AAGCCCTGGAGAAGTCCAAGAAAGAGAGGGCTTACAACAGGrAGrGAGTGGACACTCACC 720 



KALE 



R A Y K R 



CGTGGTCAGTGACACAGCCACCAGTGTGTTTATGGTATCGCTGCCTTCGTGGAAGTCCGA 780 
GGGCAGTCAGATGAGTATTTTAGAGAATGTT AGCCGTGCCTCCTCTCCTCGGATGGCGGA 8 40 
GACAGCCTGCGTCTTCATGGTGTACCTTGTTCCCAGACTGGG7CTGCAGAGCCGCAATTT 900 
AGAAACTACAAArACAGGTTCGAATTAACCATGTCAGCGAGTGTCAGACTGATTTTGGGG 960 
GTTGGGGAGTGACTTGGAAGTAAGCTAACAGCTGTATAAAGAAGAGATTTC ATTTGGCTG 10 20 
TTTTGACAAAAATGTTACGGTTTACAAAAAAAAAAAAAAAAATTAGCCAGGTTGCTGAAG 1080 
TACAACAAGCGTGCTTCTTGCT ACCATAATGTAT ATCC ACACGACAAGTTGGTGGTCTAA 11 40 
AGTC7AAATATTATTATrTTTTAAAGAGGTAAATGGGTAAATTTTACATGACATATTTTA 1200 
TACATGGCCTATTCCCTAACTGGCCATTTTTAATGACTGGGTACATTTTTAATAGGTCAG 1260 
ACAAAAGTGGTCCAGGCAGTGGGTCCCAGTCTTGCCTTTGCTACCCTAGGTCATGGTGTA 1320 
GCCACCTTTAACTTATATGAAGTGTATAAATGTACATC'rrGCCCCTGCTGTATCA7AACC 1380 
GGAAGTGCAGCCTGGTGCTGTTATGAAGCCAAGGTGTGCATCCTGCTGCGTGTGTGAGCT 1440 
GrATAGATGTTGCGTCAAGAAATAAATGAAACTTGGCCAGTTTCTTTCTTTAGCATTTAA 1500 
TTTTAACATACAAATAATTTTAAATTTTTGTCTTAAAAATTTATACACCAGCAGTTTAGA 1560 
CAAAGCCTTAAGCAAATTGTATATTATTGATCTCACGTAATAAGGAAGTAGGCATTATGT 1620 
CCATGCCAGCAAATACATGTCAAAGGTGTTTAGACCAGAGGTTAGGACGTGACTGTGCTT 1680 
GTGAGGTCCAGCCCCACCATCTGTTTT7GTATGGCCTGTGAGCTAAGAATGGTCTCTATG 17 40 
TTTCTGAG7GGTrTTTAAAATCACATT?CATGATACACGAAAGTCCTATGAAATGAAGAT 1800 
TTGCGTGTCTATAAATAAAGACTCATTGGAACACC 1835 



Fig. 3. Sequences of the putative promoter areas of the ingl gene, ingl cDNAs, and their predicted protein products. A, sequence 
of the genomic region with the putative promoter areas of cDNA isoforms la and lb; the start sites of transcription of these two variants are marked 
by arrows (the start site of isoform lb was estimated based on the size of the 5 '-RACE product and the analysis of genomic sequence). The initiator 
sequence of isoform la, including multiple initiation start sites, is underlined; TATA-like sequence for isoform lb is shown in boldface type. The 
underlined boldface italic sequence is the CAAT box positioned about 100 base pairs upstream from the lb transcript initiation site. Spl binding 
sites are displayed in boldface italic type, B, sequence of the putative promoter area of isoform Ic, with the underlined initiator sequence 
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mouse KLSPAKGEQIHLVNYVEDYII)SIESLPFDLQBNVSLMRCIOAKYQEXLKELDDYYEKFKR 60 

human MLSPANGEQLHLVNYVEDYLDSIESLPFDLQRNVSLMREIDAKYQEILKELDECYERFSR 60 

mouse ETDGTQKRRVUICIQRALII^QEI^EKIQIVSQMVELVENRSRQVDSHVELFEAHQDIS 120(26) 

h um a n ET DGAQKRRMLH CVQRAL IRSQELGDEKIQIV SQMVELVEN RTRQV D S H VE L FE AQOE LG 120 

mouse D<n'GGSGKAGQDKSKSEAITQJU>KFNNKIU5RRQRNNENR£NASNNHDHDDITSGTPKEKK 180(86) 

human DTVGNSGKVGADRPNGDAVAQSDKPNSKRSRRORNNENRENASSNHDHDDGASGTPKEKK 1 SO 

mouse lECTSKtOaCRSKAKAEREASPADLPIDPNEPTYCIXSIQVSYGEMIGCDNDECPIEWFHFSC 240(146) 

human AKTSKKKKRSKAKAEREASPADLPrDPNEPTYCLCNQVSYGEMIGCDNDECPIEWFHFSC 24 0 

mouse VGLiniKPKGEWYCPKCItGESEKTKDKALEKSKKERAYNK 279(185} 

human VGLNHKPKGKWYCPKCRGENEKTMDKALEKSKKERAYNR 279 



B 



Nous* ingl (lb) 

P33IHGl_Hs_2829208 

P33INGlp_HS_3041B55 

yNL097c_Sc_2131886 

yOR0S4c_Sc_lC20209 

yKR090c_Sc_62 6641 

SPAC3G9_Sp_2706459 

Consensusl 

Hous* inoKlto) 

P33INGl_Hs_2829208 
P33INGlp_Hs_3041855 
YNL097c_Sc_21 31886 
YORO 6 4c_Sc_l 4 20209 
YHRO 9 Oc_Sc_6 26641 
SPAC3G9_Sp_2706459 
Consensusl 

YPL138c_SC_1244777 

CllG6.3_Ce_1229037 

SPBC17Dll_Sp_365040 3 

Mi-2_Hs_1585696 

CHD3_Hs_2645433 

K1AA0333_H5_2224507 

KlAA0215_Hs_1504012 

F33E11.3_Ce_315B49e 

Consensus^ 



22 lESLPFDLQRNVSUIREinXKyQEILKELDD 
22 IBSLPrDLQRWVSLMREIUAKyQEILKELDE 
? IBQLPMDLRDRrTEMREMDLOVOMAMDCLEO 
17 LEEPPLATSRYLTLLHEIDAKCVHSMPNLNE 
20 IDHLPCELIRSLRLMOTIDLFKNEEDEPGME 
13 VSSLPSEFRYLLEEIGSNDLKLIEEKKKYEO 
15 U>BIVPHETKHirDElSV?:EVAVHDIWi;RIOA 
hppbP b b -hp 

NEPTlfS-LftjQVSY- - -OEMIi 
KB PTxfl - r^MQ VSY - - - GEMI ( 
KEPRirffl-iaiQVSy- - -OEMVC 

GE PLYS - YgNQVAY GEMVC 

OEEVtB-FSRNVSY GPMVJ 

DKTLYS - FSORVSr CEHV; 

DHEKYg - rSoQGSY O 

p TC hCpp so G HbsCDs 



13 XRRVLHCIQRALTRSOELGDEKIOIVSQMVELVEHRSRavOSHVEL 67 

13 KRRKLHCVDRAI.IRSQEliGDEKI0IV6QMVELVEimTR0VDSHVEL 67 
16 REEOMASIKKDTYKALEDADEKVOLAJOIYDLVDRHLRKLOOELAK 75 
16 CVRLLNNINKIYEKLMPSLEKKMHVSSIMLDNLDR1.TSRLELAYEV 67 
29 ELE10KSVTKMFNSSLENIKSKLTI.EKPGAyKEPKLLLRiaLKKAK 41 

14 EDGLDKE1KESL,LKCQSL0REKCVI.AMTALFLIAHHLNKI^KNIAL 73 
14 EDALYSTIREEYOKAINIOMEKVQLADRARLGLTRHIKRLDDRIAK 73 

P bpp b p ppKb bp p pbp 
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Fig. 4. Protein sequence analysis of p37^^^. A, alignment of amino acid sequences of mouse ingl lb isoform and its human ortholog. 
Numbers of amino acids are indicated; numbers in parenthesis indicate the size of the truncated protein product of la and Ic ingl isoforms. The 
underlined methionine in the mouse sequence indicates the beginning of the protein product encoded by isoforms la and Ic. The asterisks indicate 
identical amino acids, while dots indicate conserved changes in amino acid sequence. B, a multiple alignment of p33^^*^^, its yeast homologs, and 
additional PHD finger-containing proteins. The aligned conserved blocks are separated by variable spacers whose lengths are indicated by 
numbers; for the N-terminal block, the distance to the protein N termini is indicated (the sequence of the human paralog of pSS^^*^^ is incomplete). 
Consensusl shows amino acid conservation in the p33^^*^^ family of proteins; Consensus2 shows the conservation in an expanded set of PHD finger 
proteins (in addition to the six proteins of the p33^^^^ family, the sequences that aligned with p33^^*^^ with a probabihty of a random match below 
10"** in the first iteration of the PSI-BLAST analysis were included). Each consensus shows amino acid residues conserved in all sequences of the 
respective set; h indicates a hydrophobic residue, p indicates a polar residue, - indicates a negatively charged residue, s indicates a small residue, 
and a indicates an aromatic residue. The metal -chelating residues in the PHD finger domain are shown by white type on black. Each protein is 
identified by a gene name followed by species abbreviation and the gene identification number fi-om the nonredundant protein data base at the 
NCBI. p33^^*^^p is the human paralog of p33^^*'^ Mi-2 is a human autoantigen, and CHD3 is a chromatin -associated helicase of the SNF2 family; 
the remaining proteins are uncharacterized gene products designated by their systematic gene names. Species abbreviations are as follows. Hs, 
Homo sapiens; 5c, Saccharomyces cerevisiae; Sp, Schizosaccharomyces pombe; Ce, Caenorhabditis elegans. 



deficient mice, which showed no direct effect by p53 on iiigl 
regulation (data not shown). In embryos, the highest expres- 
sion was found on the 11th day of development and was char- 
acterized by changes in the relative expression of different 
classes of mKNA transcripts. 

Patterns of ingl expression in embrj^ogenesis were also 
checked by in situ hybridization. We used the histoblotting 
technique, in which embryonic sections were fixed on a nitro- 
cellulose membrane and then hybridized with the RNA probes 
corresponding to the common part of the ingl cDNA. Besides 
sense and antisense in^i -specific RNA probes, we also used an 
antisense probe for jS-actin, a gene with ubiquitous expression, 
as an internal standard. Sections were prepared from 10-, 12-, 
16-, and 18-day embryos. Results of in situ hybridization are 
shown in Fig. 5C. ingl is uniformly expressed in the whole 
mouse embryo at all stages of development examined. This is in 



agreement with the result obtained from the adult mouse tis- 
sues, where ingl is expressed ubiquitously and at similar levels 
in all organs analyzed. However, in day 10 embryos, higher 
expression levels were observed in the yolk sac, while at day 16 
and 18 of development, higher levels of expression were de- 
tected in inner compartments of bones and probably match 
areas of ongoing ossification. 

In order to determine patterns of expression of different ingl 
transcripts in mouse tissues and during embryogenesis, North- 
em blots were hybridized vnth probes corresponding to the 
alternative start sites of ingl (Fig. 5, A andB). Isoforms lb and 
Ic were expressed ubiquitously in all adult mouse tissues 
tested, with the highest levels in thymus. Both isoforms were 
expressed in the embryos at all stages of development ana- 
lyzed, with the highest levels at day 7 in the case of isoform Ic 
or day 11 in the case of isoform lb. Out of all adult tissues 



overlapping the two transcription start sites. Multiple Spl binding sites are also indicated. C, sequences of ingl cDNAs and their predicted protein 
products. Sequences of the alternative la, lb, and Ic first exons as well as the common ingl exon are shown. The first ATG codon of isoform lb 
is underlined as well as the ATG codon in the common exon that is used as the initiation codon for la and Ic translation. Stop codons, indicating 
the end of translation, are also underlined. 
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Fig. 5. ingl expression in adult mouse tissues and embryos. A, results of Northern hybridization of total mouse RNAs isolated from the 
indicated organs with the indicated probes; a photograph of an ethidium bromide-stained gel is shown as a loading control. B, Northern 
hybridization of RNA on a CLONTECH mouse embryonic multiple-tissue Northern blot (mRNA samples from mouse embryos at 7, 11, 15, and 17 
days of development) with the same probes. The arrows indicate positions of the 2,37-kilobase RNA marker as well as the positions of 28 and 18 
S rRNAs. C, in^l expression in mouse embryos analyzed by in situ histoblot hybridization. Histoblots were prepared from whole body sections of 
10-, 12-, 16-, and 18-day embryos and hybridized with the sense and antisense m^i -specific ^^P-labeled RNA probes corresponding to the region 
encoding PHD domain. Antisense probe for ubiquitously expressed actin gene was used as a control. 



analyzed, mRNA for isoform la was expressed only in testis 
and also in the 11-day embryo. While there is no detectable 
signal with the la-specific probe in the 7-day embryo, traces of 
hybridization could be detected in mRNA from day 15 and day 



17 embryos, which indicates an extremely low expression level 
at these points of mouse embryogenesis. 

Indications of Proliferation-dependent Regulation of ingl Ex- 
pression — To check whether ingl expression was proliferation- 
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A293 NMuMG 10(1) 

Fig. 6. ingl mKNA and protein expression varies depending 
on cell growth conditions. A, RNA was isolated from dividing 10(1) 
{lane a) and NMuMG {lane e) cell lines, as well as from the dividing 
young mouse embryonic fibroblasts {lane h). RNA was also isolated 
from contact-inhibited 10(1) {lane b) and NMuMG cells {lane f)\ serum- 
starved 10(1) cells {lane c) (64 h at 0.5% FBS); -y-irradiated 10(1) cells 
{lane d) and NMuMG cells {lane g); and senescent MEFs {lane i). Filters 
were probed with total ingl probe. The bottom panels show RNA gels 
from which Northern blots were made. The arrows indicate positions of 
the 2.37-kilobase RNA marker {upper panels) or 28 and 18 S rRNAs. B, 
detection of m^i -encoded proteins by Western immunoblotting, using 
antibodies against p33^^^''. Cell lysates were prepared using radioim- 
mune precipitation buffer from the cells treated in the same way as 
explained above. As a control, cell lysates were prepared from 293 cells 
transfected with the plasmids expressing mouse lb irigl variant {lane a) 
or the human homologue of p31"*^ that produces the protein product of 
the same size as mouse Ic and la variants. 

dependent, we analyzed RNA from two niouse cell lines, 
NMuMG and 10(1), at different growth conditions by Northern 
blot hybridization using total ingl cDNA as a probe, ijigl 
expression was also compared in senescent versus young, di- 
viding mouse embryonic fibroblasts. Results are shown in Fig. 
6A. In both cell lines, ingl was expressed at higher levels in 
dividing compared with quiescent cells (quiescence is induced 
by serum star\'ation, contact inhibition, or y-irradiation). How- 
ever, in 10(1) cells the difference in expression was specific only 
for the upper ingl specific band, which corresponds to isoform 
lb. In mouse embryonic fibroblasts, ingl was expressed at very 
low levels without any differences between di\^ding and senes- 
cent cells. 

ingl expression in cell lines was also analyzed by Western 
blotting using an IgGl mouse monoclonal antibody produced 
against human recombinant pSS^'^^^ (16) (Fig. This anti- 
body has previouslj^ been shown to specifically detect the de- 
natured form of mouse p33^^'^^ protein in Western immunoblot 
protocols. Cell lysates were produced from dividing, contact- 
inhibited, serum-starved, or y-irradiated 10(1) and NMuMG 
cells. In both cell lines, two protein products were detected, 31 
and 37 kDa in size, which correspond to the truncated INGl 
and lb itigl protein products, respectively. The 37-kDa protein 
was present at the same levels in dividing and quiescent cells, 
while the 31-kDa protein was present at higher levels in divid- 
ing compared with nondividing cells. The variation of expres- 
sion of ingl differs significantly at mRNA and protein levels, 
indicating that this gene could be regulated at the level of 
transcription or protein stability. 

Overexpression of pST"^^^ Inhibits p53 Function — ^All previ- 
ous functional analysis of INGl was done using human cDNA 
expressing the shorter protein encoded by this gene (1, 2, 4). As 
a result, p33^^'^^ was defined as a negative growth regulator 



that cooperates with p53 in transcriptional activation of p53- 
responsive genes. Its mouse homologue, p31"^^, has a similar 
biological effect (data not shown) that is consistent with the 
high degree of identity between the mouse and the human 
proteins (see Fig, 4). To analyze the influence of . the longer 
product of ingl on p53 function, we overexpressed cDNA for 
p37i/i^i in mouse cells maintaining wild type p53. To effectively 
monitor p53-dependent transcriptional activation, we used pre- 
\nously characterized ConA cells (see "Materials and Meth- 
ods"), carrying the p53-responsive bacterial lacZ gene. We 
tramsduced ConA cells with retrovirus expressing pS?*"^-^ and 
enhanced green fluorescent protein from a bicistronic mRNA 
and generated cell populations with different levels of overex- 
pression of p37*"^^ by sorting cells differing in the levels of 
enhanced green fluorescent protein fluorescence. These cells 
were compared with ConA cells transduced with either the 
empty pLXIG vector or with retroviral vector expressing the 
short protein product of ingl. The expression of introduced 
constructs was confirmed by Western immunoblotting using an 
anti-p33^^'^^ antibody that recognizes both human and mouse 
proteins (16) (Fig. IB), p53-dependent expression of lacZ was 
determined bj^ X-gal staining of ConA cells treated with UV 
hght (Fig. 7A). The expression of p53 protein and p53-respon- 
sive p21"^"^^ protein in untreated and in UV-irradiated cells 
was estimated by Western inamimoblotting (Fig. IB). A de- 
creased intensity of the p21 band by Western blot and a weaker 
X-gal staining of ConA cells treated with Hght indicate that 
^^rjingi clearly inhibited UV-induced accumulation of both p53- 
inducible proteins. Moreover, overexpression of p37^'^^^ 
strongly attenuated accumulation of p53 in response to UV 
hght (Fig. IBh These observations demonstrate that p37^"^^ 
has a suppressive effect on p53-dependent transcriptional ac- 
tivation supposedly by attenuating p53 accumulation in re- 
sponse to DNA damage. This means that the two proteins 
encoded by the ingl gene have opposite effects on p53 function. 

It was previously shown that p33^^^^^ can be detected in cells 
in a complex with p53 by co-immunoprecipitation (4). Since at 
that time it was not known that INGl encodes two different 
proteins, it was unclear which of the two products of INGl was 
detected in these experiments. We therefore compared the abil- 
ity of short and long ZA^Gi -encoded proteins to form a complex 
with p53 by using immxmoprecipitation. Since the protocol of 
immunoprecipitation was optimized for human proteins, we 
analyzed human variants of INGl proteins expressed in hu- 
man cells by transient transfection. p53-deficient Saos-2 cells 
were used in these experiments that have both alleles of the 
p53 gene deleted and presumably retain an intact p53 path- 
way. They were co-transfected in different combinations with 
the expression plasmids encoding wild type p53 and the short 
or long products of INGl. Cell lysates were treated with anti- 
p53 antibodies (see "Materials and Methods"), and the presence 
of ZATGi -encoded proteins in precipitates was detected by West- 
em immunoblotting (Fig. 7C). The results obtained showed 
that although p37^^'^^ is co-precipitated wath p53, the shorter 
INGl product is not detectable in the p53 precipitate. This 
observation suggests that differences in the effect of the two 
INGl products on p53 function could be a reflection of differ- 
ences in their interaction with p53 in the cell. 

DISCUSSION 

Multiple Products of the ingl Gene — INGl was originally 
described as a gene whose suppression promotes neoplastic 
transformation (1). Consistent with that, ectopic expression of 
the first isolated human cDNA was growth-suppressive for 
different cell lines. INGl was found to encode a nuclear protein 
termed p33^-'^*^^ that was shown to functionally and physically 
interact with p53 in cell growth regulation (4, 22). Now, after 
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Fig. 7. p37*"*'' affects p53 function. A, dependence of 3-galactosidase activation after UV irradiation of ConA cells on the expression levels of 
pgy.n^j Qqj^\ cgiig expressing the human homologue of p31*"*^ {lane a); p3T"^^ at low (lane c), medium (lane d), or high (lane e) levels; and ConA 
cells infected with pLXIG vector alone (lane b) were equally plated in a 12-well plate, UV-irradiated (25 J/m^), and X-gal-stained 16 h after the 
treatment. A fragment of the 12-well plate and microscopic views of individual wells are shown. B, expression of p53 protein, p53-responsive p21"^^ 
protein, and in^l protein products in untreated and in UV-irradiated ConA cells was estimated by Western immunoblotting. Analyzed cell lysates 
were isolated using radioimmune precipitation buffer from the cells treated in the same way as explained for A. C, coimmunoprecipitation of p53 
and INGl protein products in Saos-2 cells. Detection of p53 (upper panel), p37^^^^ (middle panel), and p31^""^*^^ (lower panel) proteins by Western 
immunoblotting is shown. Lanes f, g, and i (top and middle panels), lysates from Saos-2 cells co-transfected with p53 and pS?^*^ '-expressing 
plasmids. Lanes f,g, and i (bottom panel), lysates from Saos-2 cells co-transfected with p53 and p31^^^' -expressing plasmids. Lane h, lysate from 
control nontransfected cells. Lanes f zndg contain the products of immunoprecipitation with anti-p53 antibodies (lane g) and control nonspecific 
antibodies (lane f). 



we characterized the structxire and expression of the mouse 
ingl gene, it became clear that ingl regulation is more compli- 
cated than was originally thought, ingl is transcribed from at 
least three differently regulated promoters, and the resulting 
transcripts encode at least two different proteins. 

A similar type of regulation (alternative initiation leading to 
variability of 5'-exons) found in other tumor suppressor genes, 
namely BRCAl , APC, and INK4 (23-26), is associated with the 
generation of proteins with different functions (in the case of 
INK4, different reading frames are translated in the alterna- 
tive transcripts). Consistently, the two products of ingl have 
opposite effects on p53-dependent transcription regulation; one 
acts as a p53 cooperator, while the other acts as a p53 
suppressor. 

All of the transcripts share a common region encoded by a 
common axon but differ in their 5'-exons, Two of these alter- 
native exons do not contain protein-coding sequences (isoforms 
la and Ic), while the third one does (isoform lb). Consistentlj% 
one of the ingl transcripts encodes a 37-kDa protein (p37^^'^^), 
while two others are translated into a shorter protein of 24 kDa 
that surprisingly nins as if it was 31 kDa (p31^^^^). Structures 
of mouse and human ingl genes are likely to be similar, con- 
sidering the high degree of evolutionary conservation of ingl 
sequences and the fact that the tw^o versions of human INGl 
that are currently available in GenBank™ also share a large 
common part but have different N termini. It is noteworthy 
that one of the human variants of INGl (GenBank™ accession 
number AF001954) does not have a homologue with similar 
5'-sequences among the identified mouse ingl isoforms, which 
raises the possibility that there could be more variants of 
mouse ingl transcripts that so far have not been isolated. 

^^^iNGi homologs in such a distantly related species 



as yeast contain a remarkably conserved PHD finger domain 
and an additional, weakly conserved domain of xmknow^n fimc- 
tion. PHD fingers have been shown to bind DNA (21, 27), but to 
our knowledge there is no evidence that they mediate protein- 
protein interactions. Therefore, it seems likely that the C- 
terminal PHD domain in the pS3^^^ family of proteins is in- 
volved in specific DNA binding that may be important for 
transcription regulation. Given the outstanding conservation of 
this domain across the large phyiogenetic distance that sepa- 
rates humans and yeast, the specificity of DNA binding with 
respect to particular binding sites is expected to be conserved 
either in terms of DNA sequence, distinct features of chromatin 
structure, or both. It may be further surmised that the con- 
served N-terminal domain is involved in specific protein-pro- 
tein interactions that couple transcription regulation by 
pgg/A^Gj gj^^ homologs to other elements of cell cycle control. 
The conservation of domain organization in p33^^'^^ and its 
yeast homologs is particularly notable given that yeast does not 
encode any homologs of p53. Thus, it appears that p33^^^^ is an 
ancient cell cycle regulator whose interaction with p53 is a 
later evolutionary addition. 

Regulation of ingl Expression — The results of expression 
analysis indicated that mouse ingl is a subject of regulation 
both at the protein and mRNA levels and that different iso- 
forms have different expression patterns. Thus, p37^^^^ is 
ubiquitously expressed in all tissues analyzed with elevated 
mRNA expression levels in the thymus, while levels of the 
p31^^^^^ protein vary dramatically among organs and cell lines. 

Sizes of the ingl mRNA transcripts detected by Northern 
hybridization correlate well with the length of cDNA sequences 
isolated from all of the tissues except testis. Probes specific for 
each of tlie identified splice variants reveal multiple RNA 
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bands, indicating a variability of the ingl transcripts in this 
organ. It is likely that in testis ingl mRNAs are either termi- 
nated or processed differently from the rest of the tissues. 

There is a clear link between the proliferation rate of the cell 
and ingl expression both at the protein and mRNA levels. This 
observation may reflect cell cycle dependence of ingl transcrip- 
tion detected earlier for the human INGl gene; the biological 
significance of this regulation is not obvious so far. We failed to 
observe up-regulation of mouse ingl in senescent cells as it has 
been reported for human INGl (8), which suggests that regu- 
lation patterns of mouse and hiunan ingl genes may not al- 
ways be the same. 

INGl and p53: Cooperators or Antagonists? — Human INGl 
has been knowTi as a cooperator of p53 involved in negative 
growth regulation (1, 4), apoptosis (3), and senescence (2). This 
set of properties, together with the obsen^ations that show a 
decreased expression of INGl in several breast carcinoma cell 
lines and INGl gene rearrangement in one neuroblastoma cell 
hne, allow INGl to be defined as a candidate tumor suppressor 
(1-4, 16, 22). Here we demonstrate that INGl encodes, in 
addition to the protein with the above described properties, 
another product that shows an opposite effect on p53 function, 
inhibiting p53-dependent transcriptional activation. Thus, a 
single INGl gene appeared to simultaneously encode a candi- 
date tumor suppressor (p53 cooperator) and a putative onco- 
gene (p53 inhibitor). This means that a simplified view of the 
potential role of INGl in cancer shoxild be revised. In the case 
of INGl, we may have a new genetic mechanism of promoting 
cancer that involves an imbalance between the two products of 
one gene. Further characterization of the biological activity of 
the long product of INGl, p37^*'^^^, indicated that it can affect 
the whole spectrum of cell properties knovm to be controlled by 
p53, including sensitivity to apoptosis, replicative senescence, 
cooperation with dominant oncogenes, and drug response.^ 
Thus, overexpression of the longer product of INGl, which 
functionally acts as a p53 suppressor, could be the mechanism 
of attenuation of p53 activity in tumors that do not require 
mutations in p53 itself. 

A large scale analysis of chnical samples is needed to con- 
clude whether this hypothesis is correct or not. However, any 
attempts to investigate the potential cancer relevance of INGl 
should involve separate analysis of the alternative isoforms of 
this gene. It has already become obvious that a direct meas- 
urement of the overall mRNA expression in tumors cannot be 
used as an approach to judge the involvement of INGl in 
carcinogenesis. The ratio between expression levels of the two 
INGl products seems to provide much more meaningful infor- 
mation. It is therefore essential to establish such assays that 
would distinguish between the two protein products or among 
the alternative INGl transcripts. 

Although the mechanism of p53 modulation by the two prod- 
ucts of the INGl gene remains unknown, we hope that the two 
new observations described in this work will shed light on this 
problem. We found that (i) p53 is found in a complex with the 
long but not with the short product of INGl and (ii) overex- 
pression of the longer product inhibits accumulation of p53 
protein after DNA damage. Both properties of p37^^'^^ clearly 



^ L A. Grigorian, K. V. Gurova, M. Zeremski, S. S. Kwek, and A. V. 
Gudkov, manuscript in preparation. 



resemble those of the natural p53 antagonist, Mdm2, which 
was shown to affect nuclear localization of p53 (28, 29) and to 
promote its proteasomal degradation (30, 31). All of these pos- 
sibilities should be tested for p37^^*^^ as well as a potential 
interdependence of the biological activity of these two proteins. 

Is the function of INGl limited to its interaction with the p53 
signaling pathway? It seems doubtful, considering the high 
evolutionary conservation of INGl. In fact, INGl homologues 
found in yeast (Fig. 4), which does not have detectable p55- 
related genes, suggest some important p53-independent role 
for INGl. The phenotj^es associated with the knockout of 
INGl in yeast and mice may provide a lead toward its currently 
\mknown cellular function. It is noteworthy that a gene knock- 
out approach will be complicated by the necessity to separately 
inactivate expression of each of the two products of INGL 
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